Security News
Bun 1.2 Released with 90% Node.js Compatibility and Built-in S3 Object Support
Bun 1.2 enhances its JavaScript runtime with 90% Node.js compatibility, built-in S3 and Postgres support, HTML Imports, and faster, cloud-first performance.
@d3fc/d3fc-sample
Advanced tools
A data subsampler that chooses representative data points using a largest triangle or mode-median approach.
A D3 component for down-sampling data using a variety of methods. Data is typically partitioned into equally-sized buckets, and one data point from each bucket is chosen. The algorithms employed here are Largest Triangle 1 (or 3) Bucket, or Mode-Median as detailed in the thesis Downsampling Time Series for Visual Representation.
For a live demo, see the GitHub Pages site.
npm install @d3fc/d3fc-sample
The sampling components provide an API for downsampling data. They are typically used to improve rendering performance of charts or maps when there is a significant amount of data. The data is passed to the component, which then returns a smaller downsampled array.
import {largestTriangleThreeBucket} from 'd3fc-sample';
// Create the sampler
const sampler = largestTriangleThreeBucket();
// Configure the x / y value accessors
sampler.x(d => d.x)
.y(d => d.y);
// Configure the size of the buckets used to downsample the data.
sampler.bucketSize(10);
// Run the sampler
const sampledData = sampler(data);
The mode-median sampler is fairly basic. It partitions the data, then selects a representative piece of data from that set: the mode -- if it exists -- or the median.
# fc.modeMedian()
Constructs a new sampler.
# modeMedian(data)
Runs the sampler. It returns the downsampled data (it doesn't modify the data
array itself). The downsampler selects the mode (if it exists), or the median value.
# modeMedian.value(accessorFunc)
Specifies the accessor function used to obtain the value from the supplied array of data. The accessor function is invoked exactly once per datum, and should return the value to be down-sampled.
# modeMedian.bucketSize(size)
Denotes the amount of data points for each bucket. The first and last data points are always their own bucket. The second-last bucket will be of size (data.length - 2) % size
.
Largest Triangle is a sampler where, given two pre-determined points, the point in the bucket that forms the largest triangle has the largest effective area and so is the most important in the bucket. The largest triangle algorithm comes in two flavours -- one bucket and three bucket. The algorithms are described in detail in the thesis Downsampling Time Series for Visual Representation, with a summary of their descriptions reproduced here:
This algorithm is very simple. First all the points are ranked by calculating their effective areas. Points with effective areas as null are excluded. The data points are then split up into approximately equal number of buckets as the specified downsample threshold. Finally, one point with the highest rank (largest effective area) is selected to represent each bucket in the downsampled data.
The effective area of a point is the area size of a triangle it forms with its two adjacent points.
# fc.largestTriangleOneBucket()
Constructs a new sampler.
#largestTriangle(data)
Runs the sampler, returning the downsampled data (it doesn't modify the data
array itself). The sampler selects the point in the bucket with the largest area between two other points (determined by algorithm).
# largestTriangle.x(accessorFunc)
# largestTriangle.y(accessorFunc)
Specifies the accessor function used to obtain the x and y values from the supplied array of data. The accessor function is invoked exactly once per datum, and should return the value to be down-sampled.
# largestTriangle.bucketSize(size)
Denotes the amount of data points for each bucket. The first and last data points are always their own bucket. The second-last bucket will be of size (data.length - 2) % size
.
The algorithm is similar to the above, except it works with three buckets at a time and proceeds from left to right. The first point which forms the left corner of the triangle (the effective area) is always fixed as the point that was previously selected and one of the points in the middle bucket shall be selected now.
The point used to form the triangle in the last bucket is a temporary point which is the average of all other points within that bucket.
# fc.largestTriangleThreeBucket()
Constructs a new sampler.
# largestTriangle(data)
Runs the sampler, returning the downsampled data (it doesn't modify the data
array itself). The sampler selects the point in the bucket with the largest area between two other points (determined by algorithm).
# largestTriangle.x(accessorFunc)
# largestTriangle.y(accessorFunc)
Specifies the accessor function used to obtain the x and y values from the supplied array of data. The accessor function is invoked exactly once per datum, and should return the value to be down-sampled.
# largestTriangle.bucketSize(size)
Denotes the amount of data points for each bucket. The first and last data points are always their own bucket. The second-last bucket will be of size (data.length - 2) % size
.
d3fc-sample also comes with a data bucket utility, used by the algorithms. It partitions data into evenly-sized chunks.
# fc.bucket()
Construct a data bucket utility instance.
# bucket(data)
Partitions the data into evenly sized buckets, in the form:
// where n is the bucket size
[
[data[0], data[1], ..., data[n - 1]],
[data[n], data[n + 1], ..., data[2n - 1]],
...
[..., data[data.length - 1]]
]
# bucket.bucketSize(size)
Denotes the amount of data points for each bucket. The last bucket will be of size data.length % size
.
FAQs
A data subsampler that chooses representative data points using a largest triangle or mode-median approach.
The npm package @d3fc/d3fc-sample receives a total of 4,329 weekly downloads. As such, @d3fc/d3fc-sample popularity was classified as popular.
We found that @d3fc/d3fc-sample demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 3 open source maintainers collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Bun 1.2 enhances its JavaScript runtime with 90% Node.js compatibility, built-in S3 and Postgres support, HTML Imports, and faster, cloud-first performance.
Security News
Biden's executive order pushes for AI-driven cybersecurity, software supply chain transparency, and stronger protections for federal and open source systems.
Security News
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.